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^ Abstract 

For n > 1 let Xn be a vector of n independent Bernoulli random 
^ variables. We assume that X„ consists of M "blocks" such that the 

,—1 Bernoulli random variables in block i have success probability pi . Here 

M does not depend on n and the size of each block is essentially linear 
Ph in n. Let Xn be a random vector having the conditional distribution 

4^ of Xn, conditioned on the total number of successes being at least 

where kn is also essentially linear in n. Define Yn similarly, but with 
^ success probabilities Qi > Pi. We prove that the law of Xn converges 

weakly to a distribution that we can describe precisely. We then prove 
^ that supF(Xn < Yn) converges to a constant, where the supremum 

K*" is taken over all possible couplings of Xn and 1^. This constant is 

^ expressed explicitly in terms of the parameters of the system. 
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^ 1 Introduction and main results 

> 

• ^ Let X and Y be random vectors on with respective laws jj, and u. We say 

^ that X is stochastically dominated by Y , and write X ^ 1^, if it is possible to 

d define random vectors U = {Ui, . . . , Un) and V = {Vi, . . . , V^) on a common 

probability space such the laws of U and V are equal to /i and u, respectively, 
and U < V (that is, Ui < Vi for alH e {1, ■ ■ ■ ,n}) with probability 1. In 
this case, we also write /i di J^- For instance, when X — {Xi, . . . and 
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Y = (Yi, . . . ,Yn) are vectors of n independent Bernoulli random variables 
with success probabilities pi, . . . ,p„ and gi, . . . , g„, respectively, and < Pi < 
Qi < 1 for i E {1, . . . , n}, we have X ^Y. 

In this paper, we consider the conditional laws of X and Y, conditioned 
on the total number of successes being at least k, or sometimes also equal 
to k, for an integer k. In this first section, we will state our main results and 
provide some intuition. All proofs are deferred to later sections. 

Domination issues concerning the conditional law of Bernoulli vectors 
conditioned on having at least a certain number of successes have come up 
in the literature a number of times. In [2] and a simplest case has been 
considered in which Pi = p and Qi = q for some p < q. In [ ], the conditional 
domination is used as a tool in the study of random trees. 

Here we study such domination issues in great detail and generality. The 
Bernoulli vectors we consider have the property that the Pi and take only 
finitely many values, uniformly in the length n of the vectors. The question 
about stochastic ordering of the corresponding conditional distributions gives 
rise to a number of intriguing questions which, as it turns out, can actually 
be answered. Our main result. Theorem 1.8, provides a complete answer to 
the question with what maximal probability two such conditioned Bernoulli 
vectors can be ordered in any coupling, when the length of the vectors tends 
to infinity. 

In Section 1.1, we will first discuss domination issues for finite vectors X 
and Y as above. In order to deal with domination issues as the length n of the 
vectors tends to infinity, it will be necessary to first discuss weak convergence 
of the conditional distribution of a single vector. Section 1.2 introduces 
the framework for dealing with vectors whose lengths tend to infinity, and 
Section 1.3 discusses their weak convergence. Finally, Section 1.4 deals with 
the asymptotic domination issue when n — i- oo. 

1.1 Stochastic domination of finite vectors 

As above, let X = (Xi, . . . and Y = (Yi, . . . , y„) be vectors of in- 
dependent Bernoulli random variables with success probabilities pi,...,pn 
and qi, . . . ,qn, respectively, where < j9j < < 1 for z G {1, . . . , n}. For 
an event A, we shall denote by C{X\A) the conditional law of X given A. 
Our first proposition states that the conditional law of the total number of 
successes of X, conditioned on the event {J2^=iXi > k}, is stochastically 
dominated by the conditional law of the total number of successes of Y. 

Proposition 1.1. For all k G {0, 1, ... , n}, 
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In general, the conditional law of the full vector X is not necessarily 
stochastically dominated by the conditional law of the vector Y. For exam- 
ple, consider the case n = 2, pi — p2 — qi — p and q2 — 1—p ior some p < \i 
and k = 1. We then have 

P(Xi = 1 I + > 1) ^ 



P{Yi = 1 I Fi + F2 > 1) 



2 -p' 
P 



l-{l-p)p 

Hence, if p is small enough, then the conditional law of X is not stochastically 
dominated by the conditional law of Y. 

We would first like to study under which conditions we do have stochastic 
ordering of the conditional laws of X and Y. For this, it turns out to be very 
useful to look at the conditional laws of X and V, conditioned on the total 
number of successes being exactly equal to k, for an integer k. Note that if we 
condition on the total number of successes being exactly equal to k, then the 
conditional law of X is stochastically dominated by the conditional law of Y 
if and only if the two conditional laws are equal. The following proposition 
characterizes stochastic ordering of the conditional laws of X and Y in this 
case. First we define, for i e {1, . . . , n}, 

ft := -P^i^. (1) 

1 - Pi qi 

The Pi will play a crucial role in the domination issue throughout the paper. 
Proposition 1.2. The following statements are equivalent: 

(i) All f3i (i E {I, . . . ,n}) are equal; 

(ii) L{X I Er=i ^^ = k)= C{Y\ E:Li y^ = k) for all A; G {0, 1, ... , n}; 

(iii) C{X\ Xi^k)^ C{Y\ Yi = k) for some A; e {1, . . . , n - 1}. 

Wc will use this result to prove the next proposition, which gives a suffi- 
cient condition under which the conditional law of X is stochastically dom- 
inated by the conditional law of 1^, in the case when we condition on the 
total number of successes being at least k. 

Proposition 1.3. // all (5i (i G {!,..., n}^ are equal, then for all k G 
{0,1,..., n}, 

^(^1 Er=i^. > ^) ^ Tr^=ly^ > k). 

The condition in this proposition is a sufficient condition, not a necessary 
condition. For example, if n = 2, pi = p2 = |, ?i = ^5 and ^2 = then 
/3i 7^ (^2: but we do have stochastic ordering for all k G {0, 1, 2}. 
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1.2 Framework for asymptotic domination 

Suppose that we now extend our Bernoulli random vectors X and Y to in- 
finite sequences Xi, X2, . . . and Yi,Y2, . . . of independent Bernoulli random 
variables, which we assume to have only finitely many distinct success prob- 
abilities. It then seems natural to let Xn and Yn denote the n-dimensional 
vectors (Xi, . . . ,X„) and (Yi, . . . ,Yn), respectively, and consider the domi- 
nation issue as n — )■ 00, where we condition on the total number of successes 
being at least /c„ = [an\ for some fixed number a G (0, 1). 

More precisely, with /c„ as above, let Xn be a random vector having the 
law C{Xn\ Y17=i-^i — and define Y^ similarly. Proposition 1.3 gives a 
sufficient condition under which X„ is stochastically dominated by Y^ for 
each n > 1. If this condition is not fulfilled, however, we might still be able 
to define random vectors U and V, with the same laws as X„ and 1^, on 
a common probability space such that the probability that U <V is high 
(perhaps even 1). We denote by 

supP(X„<l"„) (2) 

the supremum over all possible couplings {U , V) of (X^, Yn) of the prob- 
ability that U < V . We want to study the asymptotic behaviour of this 
quantity as n — )■ 00. 

As an example (and an appetizer for what is to come), consider the follow- 
ing situation. For 2 > 1 let the random variable Xi have success probability p 
for some p G (0, ^). Fori > lodd or even let the random variable Yi have suc- 
cess probability p or 1 — p, respectively. We will prove that supP(X„ < Yn) 
converges to a constant as n — )■ 00 (Theorem 1.8 below). It turns out that 
there are three possible values of the limit, depending on the value of a: 

(i) If a < p, then supP(X„ < Yn) 1- 

(ii) If a = p, then supP(X„ < Yn) f- 

(iii) If a > p, then supP(X„ < Yn) ^ 0. 

In fact, to study the asymptotic domination issue, we will work in an even 
more general framework, which we shall describe now. For every n > 1, Xn 
is a vector of n independent Bernoulli random variables. We assume that this 
vector is organized in M "blocks" , such that all Bernoulli variables in block i 
have the same success probability pj, for i G {1, . . . ,M}. Similarly, Yn is 
a vector of n independent Bernoulli random variables with the exact same 
block structure as X„, but for 1^, the success probability corresponding to 
block i is gi, where < p, < < 1 as before. 
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For given n > 1 and i G {1, . . . , M}, we denote by rriin the size of block i, 
where of course Yli=i '^in = ^- the example above, there were two blocks, 
each containing (roughly) one half of the Bernoulli variables, and the size of 
each block was increasing with n. In the general framework, we only assume 
that the fractions rriin/n converge to some number ai e (0,1) as n — )■ oo, 
where «i = 1- Similarly, in the example above we conditioned on the 
total number of successes being at least where A;„ = \_an\ for some fixed 
a G (0,1). In the general framework, we only assume that we are given 
a fixed sequence of integers such that < /c„ < n for all n > 1 and 
kn/n — )■ a G (0, 1) as n — )■ oo. 

In this general framework, let X„ be a random vector having the con- 
ditional distribution of X„, conditioned on the total number of successes 
being at least Observe that given the number of successes in a particular 
block, these successes are uniformly distributed within the block. Hence, the 
distribution of Xn is completely determined by the distribution of the M- 
dimensional vector describing the numbers of successes per block. Therefore, 
before we proceed to study the asymptotic behaviour of the quantity (2), we 
shall first study the asymptotic behaviour of this M-dimensional vector. 



1.3 Weak convergence 

Consider the general framework introduced in the previous section. We de- 
fine Xin as the number of successes of the vector X„ in block i and write 
S„ := Yl!iLi -^iri for the total number of successes in X„. Then Xj„ has a bi- 
nomial distribution with parameters min and pi and, for fixed n, the Xm are 
independent. In this section, we shall study the joint convergence in distri- 
bution of the Xin as n — 7- oo, conditioned on {S„ > and also conditioned 
on {E„ = kn}. 

First we consider the case where we condition on {S„ = We will 

prove (Lemma 3.1 below) that the Xj„ concentrate around the values Cj^mj^, 
where the Cj„ are determined by the system of equations 

l^^ = l^^ Vz,jG{l,...,M}; 

Cin I- Pi Cjn 1 - Pj (3) 

We will show in Section 3 that the system (3) has a unique solution and that 

Cm Ci as — > OO, 

for some Cj strictly between and 1. As we shall see, each component Xj„ 
is roughly normally distributed around the central value Cmmj„, with fluc- 
tuations around this centre of the order y^. Hence, the proper scaling is 
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obtained by looking at the M-dimensional vector 



^ _ I — CinfUin — C2rjn2n ^ Mn — CMn'ITT'Mn] /,s 



Since we condition on {E„ = kn}, this vector is essentially an (M — 1)- 
dimensional vector, taking only values in the hyperplane 

So:^{izi,...,ZM) eR'^: zi + ■ ■ ■ + zm ^ 0}. 

However, we want to view it as an M-dimensional vector, mainly because 
when we later condition on {S„ > kn}, Xn will no longer be restricted to 
a hyperplane. One expects that the laws of the Xn converge weakly to a 
distribution which concentrates on 5*0 and is, therefore, singular with respect 
to M-dimensional Lebesgue measure. To facilitate this, it is natural to define 
a measure vq on the Borel sets of through 

i/o(-)-Ao( - nSo), (5) 

where Aq denotes ((M — l)-dimensional) Lebesgue measure on 5*0, and to 
identify the weak limit of the Xn via a density with respect to vq. The 
density of the weak limit is given by the function / : — > R defined by 

zf 



=1 



/w=i.,wnoxp(-^^j(^). (6) 



Theorem 1.4. The laws >C(Ar„|E„ = kn) converge weakly to the measure 
which has density f / J f dvo with respect to uq. 

We now turn to the case where we condition on {E„ > kn}- Our strategy 
will be to first study the case where we condition on the event {E„ = kn+i}, 
for £ > 0, and then sum over £. We will calculate the relevant range of i to 
sum over. In particular, we will show that for large enough i the probability 
P(E„ = kn + i) is so small, that these £ do not have a significant effect on the 
conditional distribution of Xn- For kn sufficiently larger than E(E„), only £ 
of order o(i/n) are relevant, which leads to the following result: 

Theorem 1.5. If a > J^fLiPi^i or, more generally, (A;„ — E(E„))/y^ oo, 
then the laws £(^„|E„ > kn) also converge weakly to the measure which has 
density f/ J f dvo with respect to vq. 

Finally, we consider the case where we condition on {E„ > with kn 
below or around E(E„), that is, when {kn — E(E„))/y^ — >■ X e [—00,00). 
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An essential difference compared to the situation in Theorem 1.5, is that the 
probabihties of the events {S„ > A;„} do not converge to in this case, but 
to a strictly positive constant. In this situation, the right vector to look at 
is the M-dimensional vector 



It follows from standard arguments that the unconditional laws of AT^ con- 
verge weakly to a multivariate normal distribution with density h/ j hdX 
with respect to M-dimensional Lebesgue measure A, where h: — ?• IR is 
given by 



If kn stays sufficiently smaller than E(S„), that is, when K = — oo, then the 
effect of conditioning vanishes in the limit, and the conditional laws of 
given {S„ > kn} converge weakly to the same limit as the unconditional 
laws of X^. In general, if G [—00, 00), the conditional laws of X^ given 
{S„ > kn} converge weakly to the measure which has, up to a normalizing 
constant, density h restricted to the half-space 



Theorem 1.6. // {kn — E(Il„))/-\/ra — )• K for some K E [—00,00), then 
the laws > converge weakly to the measure which has density 

h Ihi^ / J h Ihk d\ with respect to X. 

Remark 1.7. If {kn — E(S„))/y^ does not converge as n — i- 00 and does not 
diverge to either 00 or —00, then the laws £(Ar^|S„ > kn) do not converge 
weakly either. This follows from our results above by considering limits along 
different subsequences of the kn- 

1.4 Asymptotic stochastic domination 

Consider again the general framework for vectors Xn and Yn introduced 
in Section 1.2. Recall that we write Xn ior a random vector having the 
conditional distribution of the vector Xn, given that the total number of 
successes is at least kn- For n > 1 and i G {1, . . . , M}, we let denote the 
number of successes of X„ in block i. We define Yn and Ym analogously. We 
want to study the asymptotic behaviour as n — > 00 of the quantity 





,...,Zm) eR^': Zi + --- + ZM> K}. 
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SUpP(X„ < Yn), 
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where the supremum is taken over all possible couplings of X„ and Yn- 

Define /3j for i G {1, . . . ,M} as in (1). As a first observation, note that 
if all Pi are equal, then by Proposition 1.3 we have supP(X„ < Yn) = 1 
for every n > 1. Otherwise, under certain conditions on the sequence kn, 
supP(X„ < Yn) will converge to a constant as n — )• oo, as we shall prove. 
The intuitive picture behind this is as follows. Without conditioning, 
^ Yn for every n > 1. Now, as long as kn stays significantly smaller 
than E(E„), the effect of conditioning will vanish in the limit, and hence we 
can expect that supP(X„ < Y^) — >■ 1 as n — )■ cx). Suppose now that we 
start making the kn larger. This will increase the number of successes Xin 
of the vector X„ in each block i, but as long as fc„ stays below the expected 
total number of successes of Y^, increasing fc„ will not change the numbers 
of successes per block significantly for the vector y„. 

At some point, when kn becomes large enough, there will be a block i 
such that Xin becomes roughly equal to Yin- We shall see that this happens 
for kn "around" the value kn defined by 

M 



kn ■- 



^Pi + /3max(l -Pi) 

where ;9max := iiiax{/3i, . . . , /^Af }. Therefore, the sequence kn will play a 
key role in our main result. What will happen is that as long as kn stays 
significantly smaller than Xj„ stays significantly smaller than Yin for each 
block z, and hence supP(X„ < Yn) — ?■ 1 as n — )■ oo. For kn around kn 
there is a "critical window" in which interesting things occur. Namely, when 
{kn — kn)/\/n converges to a finite constant supP(X„ < Yn) converges 
to a constant Pk which is strictly between and 1. Finally, when kn is 
sufficiently larger than A;„, there will always be a block i such that Xj„ is 
significantly larger than Yin- Hence, supP(X„ < Yn) — )■ in this case. 

Before we state our main theorem which makes this picture precise, 
let us first define the non-trivial constant Pk which occurs as the limit of 
supP(X„ < Yn) when kn is in the critical window. To this end, let 

/:={zG{l,...,M}: A = /3^ax}, 
and define positive numbers a, h and c by 



a' 



/3maxPi(l -Pi)0ti 



fe + /3max(l-p.))2' ^^^^ 

= + 6l (9c) 
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As we shall see later, these numbers will come up as variances of certain 
normal distributions. Let R — > (0, 1) denote the distribution function of 
the standard normal distribution. For K define Pk by 



K 



(10) 















\ ac 


a J 





K b 

— + - 
a ac 



+ —Rk) a a > YlfLiPiOii- 



where Rk = \/K'^ + (? log(c2/6^). It will be made clear in Section 4 where 
these formulas for Pj^ come from. We will show that Pk is strictly between 
and 1. In fact, it is possible to show that both expressions for Pk are strictly 
decreasing in K from 1 to 0, but we omit the (somewhat lengthy) derivation 
of this fact here. 

Theorem 1.8. // all {% G {l,...,M}j are equal, then we have that 
supP(X„ < Yn) = 1 for every n > 1. Otherwise, the following holds: 

(i) If {kn — kn)/ \/n — 7- — oo, then supP(X„ < 1^,) — )• 1. 

(ii) If {kn — kn)/\/n — t- K for some K G R, then supP(X„ < Y^) Pk- 
(Hi) If {kn — kn)l\pn —7- oo, then supP(X„ < Y^) — )■ 0. 



Remark 1.9. If /3j 7^ (ij for some « 7^ j, and [kn — kn)/^/n does not converge 
as n — )■ 00 and does not diverge to either 00 or —00, then supP(X„ < Yn) 
does not converge either. This follows from the strict monotonicity of Pk-, 
by considering the limits along different subsequences of the kn- 

To demonstrate Theorem 1.8, recall the example from Section 1.2. Here 
/3max = ^, kn = pu, I = {1} and a^ = b'^ = |p(l — p). If a = p, then we have 
that {kn — kn) I \fn — > as n — )• oo. Hence, by Theorem 1.8, sup P(X„ < Yn) 
converges to 



/•O .2/2 3 

1-2/ -^{^{-z)-\l2)dz = - 



In fact. Theorem 1.8 shows that we can obtain any value between and 1 
for the limit by adding yK^/n\ successes to for G P. 

Next we turn to the proofs of our results. Results in Section 1.1 are 
proved in Section 2, results in Section 1.3 are proved in Section 3 and finally, 
results in Section 1.4 are proved in Section 4. 
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2 Stochastic domination of finite vectors 

Let X = (Xi,...,X„) and Y = be vectors of independent 

Bernoulli random variables with success probabilities pi, . . . , p„ and qi, . . . ,qn 
respectively, where < pi < qi < 1 for i E {I, n} . 

Suppose that Pi = p for all i. Then ^"^^^ Xi has a binomial distribution 
with parameters n and p. The quotient 

P(Er=i^^ = ^ + l) _n-k p 



PiEtiX^ = k) k + ll-p 

is strictly increasing in p and strictly decreasing in k, and it is also easy to 
see that 

-^(^1 EtiXi = k)^ C{X\ Y.l=lX^ = k + l). 

The following two lemmas show that these two properties hold for general 
success probabilities pi, . . . 

Lemma 2.1. For A; G {0, 1, . . . , n — 1}, consider the quotients 

^n._ P(Er=i^. = fc + i) 

and 

iJoi/i (11) and (12) are strictly increasing inpi, . . . ,pn for fixed k, and strictly 
decreasing in k for fixed pi, . . . ,p„. 

Proof. We only give the proof for (11), since the proof for (12) is similar. 
First we will prove that is strictly increasing in pi, . . . for fixed k. By 
symmetry, it suffices to show that is strictly increasing in pi. We show 
this by induction on n. The base case n = 1, /c = is immediate. Next note 
that for n>2 and k G {0, . . . , — 1}, 



p(Er=i' X, = k)p^ + p(Er=7 X, = A; + i)(i - p„) 



p(Er=i = - IK + p(Er=i = - Pn) 



p.+Qri(i-p„) 



which is strictly increasing in pi by the induction hypothesis (in the case 
/c = n — 1, use Q^~^ = 0, and in the case k = 0, use l/Q^zl = 0). 
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To prove that is strictly decreasing in k for fixed pi, . . . ,Pn, note that 
since is strictly increasing in p„ for fixed A; G {1, . . . , n — 2}, we have 



< 



dpn ' dpnPn/Qt-\ + {l-Pn) (p„/Q^:l + (1 - ' 



Hence, Q'^ < Q^_i- This argument applies for any n>2. □ 

Let = {X^, . . . , X^) have the conditional law of X, conditioned on the 
event {Yli=i-^i = Our next lemma gives an explicit coupling of the X^ 
in which they are ordered. The existence of such a coupling was already 
proved in [1, Proposition 6.2], but our explicit construction is new and of 
independent value. In our construction, we freely regard X'' as a random 
subset of {1, . . . , n} by identifying X^ with {i G {1, . . . , n} : Xf = 1}. For 
any K C {1, . . . , n}, let {Xa- = 1} denote the event {Xj = 1 Vi G K}, and 
for any / C {1, . . . , n} and j G {!,..., n}, define 



Lc{l,...,n}: |L|=|/|+1 



Lemma 2.2. For ani/ / C {!,..., n}, t/ie collection {7j,/}jg{i,...,n}\/ o 
probability vector. Moreover, if I is picked according to X^ and then j is 
picked according to {7j,/}je{i,...,n}\/; ^he resulting set J = {/, j} has the same 
distribution as if it was picked according to X'^"'"^. Therefore, we can couple 
the sequence {X'^j^^i such that F{X^ < X^ < ■ ■ ■ < X"-i < X") = 1. 

Proof. Throughout the proof, /, J, K and L denote subsets of {!,... ,n}, 
and we simplify notation by writing S„ := X]j"=i^«- First observe that 

5^7,-,= Yl P(^L = 1|S„ = |/| + 1) = 1, 

j0 L: \L\=\I\ + 1 

which proves that the {7j,/}j^/ form a probability vector, since 'jjj > 0. 
Next note that for any K containing j, 



F{Xk = 1 


1 — 


\K\) _ P(X, = 1) P(S„ = 


\K 


1-1) 


P{Xk\{,} = 1 1 


= 1 


K\ 


-1) P(X, = 0) P(E„ = 




^1) 



(13) 

Now fix J, and for j G J, let / = /(j, J) = J\ {j}. Then for j G J, by (13), 

P(X, = 1|S„, = |J|) ^ ijjeL) 
= p(x, = i|s„ = |/|)^^Z.|^|^lW'^^^^^^^-^ = = '"^ 

i^:|X| = |7| I \ I 



P(Xj = 1 


s„ = 


1^1) 


P(X, = 1 


1 5]„ = 


l/l) 
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where the second equahty follows upon writing K = L \ {j}, and using 
|L \ /| = \L\ J\ + 1 = \K \ J\ -\- 1 = I J \ f^l in the sum. Hence, by summing 
first over j and then over K, we obtain 

^7,-,P(X, = 1 I S„ = |/|) = F{Xj = 1 I S„ = \J\). □ 
Corollary 2.3. For k & {0,1, ... ,n — 1} we have 

^(^1 Etl^r >k)^ C{X\ Y.l=lX^ >k + l). 

Proof. Using Lemma 2.2, we will construct random vectors U and V on 
a common probability space such that U and V have the conditional dis- 
tributions of X given {^21=1^1 — ^} X given > k + 1}, 
respectively, and U <V with probability 1. 

First pick an integer m according to the conditional law of Y17=i given 
{'n^=iXi — fTi > k + 1, then pick U according to the conditional 

law of X given {Yl^=i = ^ind set V = U. If m = k, then first 
pick an integer m + i according to the conditional law of Y17=i -^i given 
{Sr=i"^* — k + 1}. Next, pick U and V such that U and V have the 
conditional laws of X given {X]r=i -^i ~ ""^i given {Y^^=i = rn+i}, 

respectively, and U < V. This is possible by Lemma 2.2. By construction, 
U < V with probability 1, and a little computation shows that U and V 
have the desired marginal distributions. □ 

Now we are in a position to prove Propositions 1.1, 1.2 and 1.3. 

Proof of Proposition 1.1. By Lemma 2.1 we have that for C. & {1, . . . ,n — k}, 

p(Er=i X. > fc + £) _ ^ p(Er=i x,>k+3 + i) 

is strictly increasing in pi, . . . ,Pn- This implies that for £ G {1, . . . , n — A;}, 

Proof of Proposition 1.2. Let x,y E {0,1}" be such that J2i=i^i — SiLi ^« 
and let k = Y17=i^i- Write / = {i G {1, . . . ,n} : Xi = 1} and, likewise, 
J = {i E {1, . . . , n} : Hi = 1} , and recall the definition (1) of We have 

PiX = x\j:t,X, = k) _ UieiP^UiiM-P^) 



PiX = y I j:tlX, = k) Y{^^JP^Y[^iA^-P^) 

n Pi TT ^-Pi ^ n»gj\j (3iP(Y = x\ Y.l=i = k) , . 



i&i\j i&j\i 
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Since |/| = \ J\ = k, we have |/ \ J| = \J \ I\- Hence, (i) implies (ii), 
and (ii) trivially implies (iii). To show that (iii) implies (i), suppose that 
-^^(^1 Eti X^ = k) = CiY\ Zti Yi = k) for a given e {1, . . . , n - 1}. Let 
i G {2, . . . , n} and let K he a. subset of {2, . . . , n} \ {i} with exactly /c — 1 
elements. Choosing I = {1}UK and J = U {i} in (14) yields A = □ 

Proof of Proposition 1.3. By Proposition 1.2 and Lemma 2.2, we have for 
m e {0, 1, . . . , n} and £ G {0, 1, . . . , n — m} 



Using this result and Proposition 1.1, we will construct random vectors U 
and V on a common probability space such that U and V have the condi- 
tional distributions of X given {Y^^=i ^ ^} ^-iid Y given {Y^^=i Yi > A;}, 
respectively, and U <V with probability 1. 

First, pick integers m and m + I such that they have the conditional 



laws of YJi=iXi given {YA=i^i > ^} and YJi=i^i given {Yl'l=i^i > k}, 



respectively, and m < m + ^ with probability 1. Secondly, pick U and V 
such that they have the conditional laws of X given {^"^j^ Xi = m} and Y 
given {X]r=i ^ = m + £}, respectively, and U < V with probability 1. A 
little computation shows that the vectors U and V have the desired marginal 



We close this section with a minor result, which gives a condition under 
which we do not have stochastic ordering. 

Proposition 2.4. If pi = qi for some i G {1, . . . ,n} hut not for all i, then 
for k e {l,...,n- 1}, 



Y.ti X^ = m)^ C{Y\ Y.ti Y, = m + l). 



distributions. 



□ 




qn- We have 



P(^n = 1 



PnnY.l=lXi>k-l) 



Pn P(Er=i' X, > A; - 1) + (1 - p„) nY^iZl X, > k) 



Pn 



Pn + il- Pn) P(Er=l' Xi > k)/ P(Er="l' X,>k-1) 



> 



g„ + (1 - qn) P(Er=i' > k)/ P(Er=i' Y.>k-1) 




□ 
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3 Weak convergence 



We now turn to the framework for asymptotic domination described in Sec- 
tion 1.2 and to the setting of Section 1.3. Recall that Xj„ is the number of 
successes of the vector X„ in block i. We want to study the joint convergence 
in distribution of the Xj„ as n — )■ oo, conditioned on {S„ > and also 
conditioned on {S„ = A;„}. Since we are interested in the limit n — )■ oo, we 
may assume from the outset that the values of n we consider are so large that 
kn and all strictly between and n, to avoid degenerate situations. 

We will first consider the case where we condition on the event {S„ = kn}- 
Lemma 3.1 below states that the Xj„ will then concentrate around the values 
Cinruin, where the Cj„ are determined by the system of equations (3), which 
we repeat here for the convenience of the reader: 

1^-^ = 1^-^ V.,JG{1,...,M}; 

Cin I- Pi Cjn 1 - Pj (3) 

EM _ 7, 

Before we turn to the proof of this concentration result, let us first look at 
the system (3) in more detail. If we write 

A„ = ^^-^ (15) 

Cin J- Pi 

for the desired common value for all z, then 

Pi 



Pi + An{l -Pi)' 



Note that this is equal to 1 for yl„ = and to pi for y4„ = 1, and strictly 
decreasing to as — )■ oo, so that there is a unique > such that 



M M 

Pim 



CinTTlin — , A n \ — ^n- (16) 

Pi + An{^ - Pi) 



i=l 



It follows that the system (3) does have a unique solution, characterized 
by this value of An- Moreover, it follows from (16) that if kn > ]E(S„) = 
'^^iPiTriin, then A„ < 1. Furthermore, kn/n — )■ a and rriin/n — )■ Oj. Hence, 
by dividing both sides in (16) by n, and taking the limit n — > oo, we see that 
the An converge to the unique positive number A such that 

M 

\- Pidj _ 
j^^Pi + A{l-pi) 
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where A = 1 ii a = '^fLiPiCti. As a consequence, we also have that 

Pi ^ 
Cin Ci = r as n — )■ oo. 

' p, + A{1 - Pi) 
Note that the q are the unique solution to the system of equations 

— T^ = ^T^ V.,JG{1,...,M}; 

Ci I- Pi Cj 1 - Pj 

/ jj—i C-iO-i Q.. 

Observe also that q = Pi in case A = 1, or equivalently YlitLiPi'^i = ci?, 
which is the case when the total number of successes kn is within o{n) of the 
mean E(Sn). The concentration result: 

Lemma 3.1. Let ci„, . . . ,CMn satisfy (3). Then for each i and all positive 
integers r, we have that 

P(|X,„ - Cmm,J > Mr I E„ = K) < 2Me-^^-'>"/\ 

Proof. The idea of the proof is as follows. Condition on {S„ = kn}, and 
consider the event that for some i ^ j we have that Xj„ = Cinrriin + s, and 
Xjn = Cjnrrijn — t, for some positive numbers s and t. We will show that if 
the Cin satisfy (3), the event obtained by increasing Xj„ by 1 and decreasing 
Xjn by 1 has smaller probability. This establishes that the conditional dis- 
tribution of the Xin is maximal at the central values Cinrriin identified by the 
system (3). The precise bound in Lemma 3.1 also follows from the argument. 

Now for the details. Let s and t be nonnegative real numbers such that 
Cinf^in + s and Cjnrrijn — t are integers. By the binomial distributions of Xin 
and Xjn and their independence, if it is the case that < CinTriin + s < 
and < CjnTTLjn — t < rrijn, then 

P(^Xin CinlTlin -|- S -|- 1, Xjn CjnTfljn ^ 1) 
lP(X^m Cinrriin ~t~ S, Xjn Cjn^T^jn ^) 

rflin Cinrflin ^ Pi \ / Cjn'Tfljn ^ 1 Pj 



Cin^in + S+1 1 -Pij \ nijn - Cjnnijn + t + l Pj 
^^in Cinrriin S Pi \ I Cjn'Trijn t 1 Pj 



< 

CinTfliyi 1 Pi / \^jn C^jnX^jn Pj 

Hence, if the Cin satisfy (3), then using 1 — z < exp(— z) we obtain 
< ( 1 111 ^ 1 < exp 
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It follows by iteration of this inequality, that for all real s,t > and all 
integers m > 0, 

lP(^m ^in^in S -\- U, Xj^i Cj^f^jri ^ ^) 

< exp (^~"~~~^ P(-^m = CinTriin + S, Xj^ = CjnrUjn - t) . (17) 

Now fix i, and observe that for all integers r > 0, 

P{Xin > CinTriin + Mr, S„ = /;;„) 

XI ^ - + Mr ) P (Xfc„ = £fc Vfc) . 

^i,.../A/eINo: 

But if £i + ■ ■ ■ + = kn and £j > Ciniriin + Mr, then there must be some 
j ^ i such that ij < Cjnrrijn — r. Therefore, 

P{Xin > Cinfriin + Mr, S„ = /c„) 
M 

3=1 £i,.../Me]No: 

By independence of the Xj„ and using (17) with s = (M — l)r, t = and 
M = r, we now obtain 



P{Xin > CinTriin + Mr, S„ = /c„) 



i=l £i,...A/6lNo: 

r>2 / 



< Me-(*^-^)^ P(S„ = fc„). 

This proves that 

P(X,„ > c^nruin + Mr I = A;,) < Me-^^"^)^'/". 

Similarly, one can prove that 

P(X,„ < Q„m„ - Mr I S„ = A;„) < Me-^^^-^)^'/". □ 

As we have already mentioned, we expect that the X^ have fluctuations 
around their centres of the order y/n. It is therefore natural to look at the 
M-dimensional vector 

^ / Xln — Xin X2n — X2n Xmu ~ ^Mn \ 

:- ^ , ^ ^ , l-Loj 
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Figure 1: The shear transformation a (illustrated here for M = 2) maps 
sheared cubes to cubes. The dots are the sites of the integer lattice 1? . The 
gray band on the left encompasses those sheared cubes that intersect S^. 

where the vector a;„ = . . . , x^n) represents the centre around which 
the concentrate. To prove weak convergence of X^-, we will not set Xin 
equal to the latter numbers are not necessarily integer, and 

it will be more convenient if the Xin are integers. So instead, for each fixed n, 
we choose the Xin to be nonnegative integers such that — Cj„^m| < 1 for 
all 2, and ^^i^^m = ^n- Of course, the vector Xn as it is defined in (18), 
and the vector defined in (4) have the same weak limit. In our proofs of 
Theorems 1.4 and 1.5, Xn will refer to the vector defined in (18). 

If we condition on {E„ = /c„}, then the vector Xn will only take values 
in the hyperplane 

So := {{zi, ...,zm) eR^': ZI + --- + ZM = 0}. 

However, as we have already explained in the introduction, we still regard Xn 
as an M-dimensional vector, because we will also condition on {E„ > /c„}, 
in which case Xn is not restricted to a hyperplane. To deal with this, it 
turns out that for technical reasons which will become clear later, it is useful 
to introduce the projection vr: {zi, . . . , zm) (-Zi, • • • , zm-i) and the shear 
transformation a: (zi, . . . , Zm) ^ (^^i, . . . , zm-i,Zi + ■ — h Zm)- We can then 
define a metric p on by setting p{x,y) := \ax — ay\, where |-| denotes 
Euclidean distance. See Figure 1 for an illustration. 

Using the projection vr, we now define a new measure po on the Borel 
subsets of R^, which is concentrated on 5*0, by 

/xo(-):= x''-\n{- nSo)), 

where A^^~^ is the ordinary Lebesgue measure on ]R*^~^. Note that up to a 
multiplicative constant, po is equal to the measure defined in Section 3, so 
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we could have stated Theorems 1.4 and 1.5 equally well with /io instead of uq. 
In the proofs it turns out to be more convenient to work with /ig, however, 
so that is what we shall do. 

Our proofs of Theorems 1.4 and 1.5 resemble classical arguments to prove 
weak convergence of random vectors living on a lattice via a local limit the- 
orem and Scheffe's theorem, see for instance [1, Theorem 3.3]. However, 
we cannot use these classic results here, for two reasons. First of all, in 
Theorem 1.5 our random vectors live on an M-dimensional lattice, but in 
the limit all the mass collapses onto a lower- dimensional hyperplane, leading 
to a weak limit which is singular with respect to M-dimensional Lebesgue 
measure. The classic arguments do not cover this case of a singular limit. 

Secondly, we are considering conditioned random vectors, for which it is 
not so obvious how to obtain a local limit theorem directly. Our solution is to 
get rid of the conditioning by considering ratios of conditioned probabilities, 
and prove a local limit theorem for these ratios. An extra argument will 
then be needed to prove weak convergence. Since we cannot resort to classic 
arguments here, we have to go through the proofs in considerable detail. 



3.1 Proof of Theorem 1.4 

As we have explained above, the key idea in the proof of Theorem 1.4 is that 
we can get rid of the awkward conditioning by considering ratios of con- 
ditional probabilities, rather than the conditional probabilities themselves. 
Thus, we will be dealing with ratios of binomial probabilities, and the follow- 
ing lemma addresses the key properties of these ratios needed in the proof. 
The lemma resembles standard bounds on binomial probabilities, but we 
point out that here we are considering ratios of binomial probabilities which 
centre around Cinrriin rather than around the mean Pirriin- We also note that 
actually, the lemma is stronger than required to prove Theorem 1.4, but we 
will need this stronger result to prove Theorem 1.5 later. 

Lemma 3.2. Recall the definition (15) of An- Fix i G {1, 2, . . . , M} and let 
6i, ^2, • • • be a sequence of positive integers such that bn/ \/n — )■ as n ^ oo. 
Then, for every z eH, 



sup 

X '. \x 3^ in I ^^Ti 

r: \r-zy/n\<bn 



1 F{Xin = x + r) ^ -2 

exp 



A;; F{Xin = x) \ 2ci(l - Ci)a., 



0. 



Furthermore, there exist constants B], Bf < oo such that for all n and r. 
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Proof. Robbins' note on Stirling's formula [ ] states that for all m = 1, 2, 



from which it is straightforward to show that for all m = 0, 1, 2, . . . (so 
including m = 0), there exists an rjm satisfying 1/7 < rj^ < 1/5 such that 



where we have introduced the notation |m] := m + rjm. 

Since Xj„ has the binomial distribution with parameters rriin and pi, 

1 P{Xi„ = X + r) x\ (mj„-x)! / 



(19) 



A;; F{Xin = x) {x + r)\ {rriin - x - r)\ \1 - Cin 
Using (19), we can write this as the product of the three factors 



Pl{x.r\ 



mi„, - X 



{x + r| Iniin - x-r 

x~\-r 



1/2 



X 



X + r 



X 



for all X and r such that < a; < mj„ and < x + r < rrii 
To study the convergence of Pin{x, r), first write 



1 - 



r 



X + r 



x+r 



1 + 



ITLir,. - X-r 



Using the fact that for all u > —1, (1 + u) lies between exp( 



1 2 



) and 



exp(ti— ^u^/ (l + u)) , a little computation now shows that Pj^ni^, r) is wedged 
in between 



exp 



[mi 



\ 2 



2 x{min — X — r) 



and exp 



1 {rriin + 

2 (x + r){min - x) ) ' 



From this fact, it follows that for fixed 2; G R, 



sup 

r: \r-z^\<bn 



Pfn{x,r) - exp 



2c,; (1 - c,:)a,: 



^0, 



19 



— )■ Q, hence x = Cirriin + o{n) and r = z^/n + o[^/n) under 
the supremum, and irtin/n — )■ ctj. Since — Cinmin\ < 1, we also have that 

sup |Pj^„(x, r) — l| — and sup |-P^(x, r) — l| — )■ 0. 

3;: Xiji\<ibn X'. \x Xiji\<^b'fi 

r: \r—z^\<bn r: \r—z^\<b„ 

Together with the uniform convergence of -Pj'^(a;, r), this estabhshes the first 
part of Lemma 3.2. 

We now turn to the second part of the lemma. If x and r are such 
that < X < rriin and < x + r < rriin, then mj„ — r > a; > and 
TTT-in + r > rriin — X > 0, hence from the bounds on Pj^^{x,r) given in the 
previous paragraph we can conclude that 

, , , / 1 r2 \ / 1 r2 

Pin{x,r) < exp < exp -- — 

V 2 rriin J \ 2n 

Next observe that if x is such that \x — Xin\ < bn, then \x — Cinfriinl < 1 + 
from which it follows that uniformly in n, for all x and r such that < x < 
rriin, < X + r < rriin and \x - Xj„| < 6„, 

Pf„{x,r) < I 1 + const, x — | < exp ( const, x —= | . 

To finish the proof, it remains to bound Pj^(x,r). To this end, observe 
first that uniformly in ra, for all x and r such that |x— x^l < 6„ and |r| < n^/^, 
Pj^(x,r) is bounded by a constant. On the other hand, uniformly for all x 
and r such that < x < mj„ and < x + r < rriin, PLi^y ^) bounded by a 
constant times n, and n < /ri^ if |r| > n'^^^. Combining these observations, 
we see that uniformly in n, for all x and r satisfying |x — Xi„| < 6„ and 
< X + r < mj„, 

Pl^{x, r) < const, x + . □ 

Proof of Theorem 1.4- For a point 2; in R^, let \z\ be the point in 
p-closest to 2; (take the lexicographically smallest one if there is a choice). 
Graphically, this means that the collection of those points z for which \z\ = a 
comprises the sheared cube a + a~^{— 1/2,1/2]^ , see Figure 1. Now, for each 
fixed z G R*^, set = (rf„, . . . , r|^„) := [^v^J • Observe that because (for 
fixed n) the x^ sum to kn, if G 5*0 we have that 

Fj^Xn = I S„ = kn) ^ PiV^Xn = ^n) ^ TT ^(^m = Xin + rfj 

P(v/^A'„ = I S„ = A;„) P(v/^Ar„ = 0) P(Xi„ = Xi„) ' 

(20) 
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where we have used the independence of the components Xin. If ^ 5*0, on 
the other hand, this ratio obviously vanishes. 

We now apply Lemma 3.2 to (20), taking 6„ = M for every n > 1. Since 

"^iLi ^in = if ^ 'S'o and hence Hi^i — 1; ^^^^ P^^^ of Lemma 3.2 
immediately implies that for all z G R*^, 

P(v^.*-„ = < I £„ = *•„) , , .-j^ ( -I ^ «i 



as n — 7- oo. To see how this will lead to Theorem 1.4, define /„ : R*^ — )■ R by 

/„(z) := (v^)^^P(v/^A'„ = < I S„ = 

Then is a probability density function with respect to M-dimensional 
Lebesgue measure A. Moreover, if 2^ is a random vector with this density, 
then the vector 2'^ = \Zn\fn\j ^fn has the same distribution as the vec- 
tor conditioned on {S„ = A;„}. Since clearly Zn and Z'^ must have the 
same weak limit, it is therefore sufficient to show that the weak limit of Zn 
has density /// /rf/xo with respect to 

Now, by what we have established above, we already know that 







1 




P( 




^71 





/„(0) P(vn^„ = I s„ = fc„) 

Moreover, the second part of Lemma 3.2 applied to (20) shows that the ra- 
tios fn{z)/fn{0) are uniformly bounded by some /io-integrable function g{z). 
Thus it follows by dominated convergence that for every Borel set A C R^, 

fn{z) 



dfio{z) / f{z) dfio{z). 

J A 



/n(0) 

Next observe that 1 = J fndX = J rT^I'^ d^o, because by the condition- 
ing, /„ is nonzero only on the sheared cubes which intersect 5*0. Therefore 
taking A = in the previous equation yields n~^/^/„(0) — (/ f dno)~^ 
which in turn implies that for every Borel set A, 

J^f{z) dfioiz) 



n ^^^fn{z)dfio{z) 



I fdfio 

In general, Jp, fn d\ ^ jp n'^^'^fn dfiQ for an arbitrary Borel set F, but we 
have equality here for sufficiently large n if F is a finite union of sheared 
cubes. Hence, if A is open, we can approximate A from the inside by unions 
of sheared cubes contained in A to conclude that 

hmmf / fn{z)dX{z) > tt— . □ 
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3.2 Proof of Theorem 1.5 

We now turn to the case where we condition on {S„ > A;„}, for the same 
fixed sequence /c„ — )■ oo as before. To treat this case, we are going to consider 
what happens when we condition on the event that S„ = /c„ + £ for some 
i > 0, and later sum over i. It will be important for us to know the relevant 
range of i to sum over. In particular, for large enough i we expect that the 
probability P(S„ = kn + £) will be so small, that these i will not influence the 
conditional distribution of the vector Xn in an essential way. The relevant 
range of i can be determined from the following lemma: 

Lemma 3.3. For all positive integers s, 

P(S„ >K + 2Ms) < Mexp (_ ikn 'H^nH Ms) s ^^ ^^^^ ^ 

Proof. Let u be such that < m < (1 — pi)min- Observe that then, for all 
integers m such that Pirriin + u < m < rriin, 

PjXjn = m + 1) _ rriin - m pj ^ Pimjn - ^jz^ 
F{Xin = m) m+1 1-pi ~ Pirriin + u 

hence 

¥{Xin = m + 1) ^ ^ _ u f ^ p, \ < ^ _ < 1 _ !^ 
¥{Xin = m) ~ piiTLin + uy I - Pi ) ~ mj„~ n' 

Since 1 — z < exp(— z), by repeated application of this inequality it follows 
that for all M > and all positive integers t, if m is an integer such that 
m > Pirriin + u, then 

P{Xin = m + t) < exp (^-^^ P{Xin = m). (21) 

Now observe that if S„ > E(S„) + Mr + 2Ms, where s is a positive 
integer, and r a real number such that r + s > 0, then for some k it must be 
the case that X^n > Pk^kn + r + 2s. Therefore, 

P(S„ > E(S„) + Mr + 2Ms) 

M 

< ^^'^^'^" + ^ + 2^)^^^- = ^'^')• 

h+-+tM>E,{T..a)+Mr+2Ms 
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But by (21), taking u = r + s and t = s, 



1(4 > Pkrrikn + r + 2s) P{Xi„ = ii Vi) 

^1 



< exp ( -^^^^^1 PiXkn = 4 - s,X,n = l,yt^ k), 



and therefore 



P(E„ > E(S„) + Mr + 2Ms) 



< Mexp ( _(!L±^ ] p(5]„ > E(S„) + Mr + 2Ms - s) 



n 



< Mexp |^_ (^ + ^)^ ^ p(5]^ > E(S„) + Mr). 

Choosing r such that /c„ = E(S„) + Mr yields Lemma 3.3 (observe that the 
bound holds trivially if r + s < 0). □ 

Lemma 3.3 shows that if a > '^fliPiC^i, then for sufficiently large n, 
P(Sn > kn + i) will already be much smaller than P(S„ > A;„) when i 
is of order logn. However, when a = YlfLiPi'^iy '^^ need to consider i of 
bigger order than ^/n for P(S„ > kn + i) to become much smaller than 
P(E„ > /cjj). In either case. Lemma 3.3 shows that £ of larger order than ^yn 
become irrelevant. 

Keeping this in mind, we will now look at the conditional distribution of 
the vector Xn, conditioned on {S„ = A;„ + £}. The first thing to observe is 
that for i > 0, the locations of the centres around which the components Xj„ 
concentrate will be shifted to larger values. Indeed, these centres are located 
at cfj^niin, where the cf„ are of course determined by the system of equations 

^T^ = ^T^ Vz,jG{l,...,M}; 

To find an explicit expression for the size of the shift cf„ — Cj„, we can sub- 
stitute cf„ = + 5in into (22), and then perform an expansion in powers of 
the correction to guess this correction to first order. This procedure leads 
us to believe that must be of the form 

^in ~ '^in "I" Cm(l ~ Cin)d^ + Cj^, (23) 

where 



^y^j J—] (^jniX (^jn)^jn 



23 



and ef„ should be a higher-order correction. The following lemma shows that 
the error terms ef„ are indeed of second order in c?^, so that the effective 
shift in Cin by adding i extra successes to our Bernoulli variables is given by 
Cin(l — Cinjd^^. For convenience, we assume in the lemma that < 1/2, 
which means that \i\ cannot be too large, but by Lemma 3.3, this does not 
put too severe a restriction on the range of i we can consider later. 

Lemma 3.4. For all £ (positive or negative) such that |c/^| < 1/2, we have 
that \efj < «)2 for all i = 1, . . . , M . 

Proof. For ease of notation, write crj„ := Cj„(l — Cj„). As before, we write 

4n ^-Vi Cin + (Tindi + ef^ I - Pi 

for the desired common value for all i, so 

e _ Pi{^ ~ Cin ~ <^indn) — ^1(1 ~ Pi){Cin + Cind^) {0A\ 

Aiil-p,)+p, • ^ ^ 

As before, the value of is uniquely determined by the requirement that 
Y^fii 4n^in = kn + i. Sluce ^ifii CinTTiin = K and ^J^^ aj„d^mj„ = £, this 
requirement says that 

M 



^ e\nmin = 0. 



i=l 



In particular, the ef„ cannot be all positive or all negative, from which we 
derive, using (24), that must satisfy the double inequalities 

I Pi{^ Cin (^indn) \ ^ aP ^ f PiiX Cin (^indn) 

mm < -7- )■ < < max ' 



i=l,...,M [{1- Pi){Cin + aindi) j " i=l,...,M {{1 - Pi){Cin + aind^ 

A simple calculation establishes that 

Pi{l - Cin - (Tind^^) I -Cin Pi /, , "A (- (1 - Q„)(i^)'' 



[l-Pi) {Cin + CTindi) Cin 1 " F 



k=l 



from which (using < 1/2) we can conclude that 

i^^(l 1^^(1 - < + 2{dif), 

^in ^ Pi ^in ^ Pi 

since by (3), neither the lower bound nor the upper bound here depends on i. 
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Inserting the lower bound on into (24) gives 

1-(1-Q„K - 2^'^'^^ ' 

where in the last step we used that \dW < 1/2 and am < 1/4. Likewise, 
substituting the upper bound on into (24) yields 



, ^ ainjl + Qn)«)' + 2a,„(l - CmMf ^ 2a,Mf 
1-(1-qO< + 2(1-q„)«)2 - 1-1/2 



For future use, we state the following corollary: 

Corollary 3.5. If {kn — ^^f^^Cirriin)/ y/n K for some K E [—00,00], then 
/orze {1,...,M}, 

(Qji ^i)'^in Cj(l Ci)Oli 

^ — M 

Ej-l9(l-9)"i 

Remark 3.6. If (A;„ — E(S„))/-\/n — > G R, then a = J2iLiPi(^i we 
have Cj = pj for all i G {1, . . . ,M}. In this situation. Corollary 3.5 states 
that the vectors — Xn, and hence also the same vectors conditioned on 
{S„ > kn}, converge pointwise to the vector whose i-th component is 

Pi{l-Pi)ai ^ 



Proof of Corollary 3.5. First, suppose that G R. If £ = cimin — k„ 
and the cf„ satisfy (22), then cf„ = Cj. Hence, by Lemma 3.4, 

Ci - Cin = Q„(l - Cin)dl + 0(((i^)^), 

where 

_ J2i=l CifTlin — kn _ ^( -l/2\ 

This implies 



from which the result follows. 
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Next, suppose that = oo. Since Cin is increasing as a function of kn, 
we have by the first part of the proof 



hm inf 

n—^oo 



Ci)mi 



> 



n 



Ci{l - Ci)ai 



for all L G R. Hence, the left-hand side is equal to oo. The proof for the 
case = — oo is similar. □ 

When we condition on {E„ = + then in analogy with what we have 
done before, the natural scaled vector to consider would be the vector 



X 



In 



n 



n 



n 



where the components of the vector 



[X 



identify the centres 



around which the Xj„ concentrate. Here, the are nonnegative integers 

kn + t Note 



< 1 for all 2, and 



X A, 



chosen such that \x\._^ , ^.-^ 

that the vector AT^ is simply a translation of Xn by {x^^ — Xn) / \fn. Since 
Lemma 3.3 shows that if fc„ is sufficiently larger than E(S„), only values of I 
up to small order in n are relevant, the statement of Theorem 1.5 should not 
come as a surprise. To prove it, we need to refine the arguments we used to 
prove Theorem 1.4. 

Proof of Theorem 1.5. Assume that {kn — E(S„))/A/n — )■ oo, and let 

1/2 

2M 



n 



k„ - E(K 



Note that then a„ — > oo but an/y/n — > 0. Furthermore, Lemma 3.3 and a 
short computation show that 



P(S„ > kn + ar, 



0. 



■IP(^?i ^ kn 

It is easy to see that from this last fact it follows that 

sup P{Xn G v4 I S„ > kn) — P{Xn G A | A;„ < S„ < + tt^ 
A 



->o. 



where the supremum is over all Borel subsets A of R*^. It is therefore suf- 
ficient to consider the limiting distribution of the vector Xn conditioned on 
the event {kn < S„ < A;„ + a„}, rather than on the event {S„ > kn}. 
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(2a„ + 



Figure 2: We coarse-grain our densities by combining (2a„ + 1)^^ sheared 
cubes into larger sheared cubes. Here, we show this coarse-graining for M = 2 
and a„ = 2. The dots are the points in ((2a„ + / y/n. The combined 

sheared cubes have been coloured in a chessboard fashion as a visual aid. 

As in the proof of Theorem 1.4, for z G R*'^ we let = [zy^J, and we 
define the functions : R*^ — )• R by setting 

U{z) ■= {^Y'¥{^Xn = rl\K<T.n<kn + a„) . 

As before, this is a probability density function with respect to Lebesgue 
measure A on R*^, and if Zn is a random vector with this density, then 
the vector Z'^ = \Zn\/n\ / y/n has the same distribution as the vector Xn 
conditioned on the event {k^ < E„ < fc„ + a„}. Hence, it is enough to show 
that the weak limit of Zn has density /// f dfiQ with respect to /iq. 

An essential difference compared to the situation in Theorem 1.4, how- 
ever, is that the densities /„ are no longer supported by the collection of 
points z for which is in the hyperplane 5*0 (i.e. the union of those sheared 
cubes that intersect Sq). Rather, the support now encompasses all the 
points z for which is in any of the hyperplanes 

Se := {{zi, ...,ZM)eR^^: Zi + --- + ZM = i}, i = 0,l,..., a„, 

because if e Si, then the event {y/nXn = r^} is contained in the event 
{S„ = kn + i}- For this reason, the densities /„ are not so convenient to work 
with here. Instead, it is more convenient to "coarse-grain" our densities by 
spreading the mass over sheared cubes of volume ((2a„ + l)/^/n)^'^ rather 
than volume {l/^/n)^^, to the effect that all the mass is again contained in 
the collection of sheared (coarse-grained) cubes intersecting Sq. 

To this end, for given n we partition R*^ into the collection of sets 

|^(a + a-i(-a„ - l/2,a„ + l/2]^^): aG ((2a„ + 1)2)'^'}. (25) 
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See Figure 2. For a given point z G R*^, we denote by the sheared cube 
in this partition containing z. Now we can define the coarse-grained densities 

9n{z) := ^ \ , P(Ar„ ^Ql\K<T.n<kn\ a^) 



2a„ + 1 



M 

n 



2a„ + 1 



By construction, these are again probabihty density functions with respect 
to M-dimensional Lebesgue measure A. Moreover, each of these densities is 
supported on the collection of sheared cubes in (25) that intersect 5*0, and is 
constant on each sheared cube Q^. In particular, for any given point z e 
we have 

9n{y)dX{y)= / gn{y)dfXo{y)- 



n 



Finally, because an/y/n — )■ it is clear that if 2'^ has density gn, then its 
weak limit will coincide with that of 2n, and hence also with that of the 
vector Xn conditioned on the event {/c„ < S„ < + a„}. 
Suppose now that we could prove that 

^^^9n{z) ^ -/j^ for every z G R^. (26) 
Then it would follow from Fatou's lemma that for every open set A C R*^, 

liminf [ + ^ g^(^z) dfio{z) > ^A^^f') J^^^^) _ 
Ja J J ano 

By approximating the open set A by unions of sheared cubes contained in A, 
as in the proof of Theorem 1.4, it is then clear that this would imply that 



liminf / gn{z)dX{z) > ^ 

J A 



Ia /(^) dM^) 



It therefore only remains to establish (26). 

Since (26) holds by construction for z ^ 5*0, we only need to consider 
the case z G 5*0. So let us fix z G 5*0, and look at gn{z). By definition, 
this is just the rescaled conditional probability that the vector Xn lies in the 
sheared cube Q^, given that /Cn < < + c^n- In other words, if we define 
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Q := n and Q„ := Q n Si, then we have 



I ;^ — -r ) ^ P(yn Ar„ = r | /c„ < S„ < /c^ + a„) 

V/:a„ + iy ^^^^ 

P(v/^ Ar„ = r I S„ = A;„ + £) P(S„ = kn + 



2a„ + 



M a„ 



£=0 reC? 



P(^n ^ ^ kn ~\~ On) 



Since contains exactly (2a„ + l)*^ points, from this equahty we conclude 
that to prove (26), it is sufficient to show that 



sup sup 



r I S„ 



kn 



f fdfio 



0. (27) 



The proof of (27) proceeds along the same line as the proof of pointwise 
convergence in Theorem 1.4, based on Lemma 3.2. However, there is a catch: 
because we are now conditioning on T,n = kn + the Xj„ are no longer 
centred around Xj„, but around We therefore ffist write the conditional 
probabilities in a form analogous to what we had before, by using that 

P(v^Ar„ = r I S„ = fc„ + £) =P{^/^Xl = r + Xn-x'n \ S„ = fc„ + £). 

Writing := r + x„ — for convenience, we now want to study the ratios 



P(V^A'1 = I T.n = kn + 



P(v/^< = 0) 



n 



p(x. 



P{X,n = Xin) 

-- o{y/n), 



for i and r satisfying < £ < a„ and r E 

By equation (23) and Lemma 3.4 we have that sup^jxf^ — Xj„ 
from which it follows that also sup^^jr^ — Zy/n\ = o{y/n), where the suprema 
are over all £ G {0, . . . , a„} and r G C/„. Thus, by the ffist part of Lemma 3.2, 



sup sup 



P(, 




1 s„ 


— kn ~\~ ^) 


P(. 


/axl^ = 




= kn + i) 



->0, 



where we have used that for all terms concerned, Y\!iLi = 1 because 
E Sq. Furthermore, from the second part of Lemma 3.2 it follows that the 
functions 

P(y^<= \z^\ I S„ = A;„ + £) 
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are bounded uniformly in n and in all £ G {0, . . . , a„} by a /xo-integrable 
function. In the same way as in the proof of Theorem 1.4, it follows from 
these facts (with the addition that we have uniform bounds) that 



sup 



0. 



From this we conclude that (27) does hold, which completes the proof of 
Theorem 1.5. □ 



3.3 Proof of Theorem 1.6 

Proof of Theorem 1.6. Suppose that {kn — E(Sri))/A/n — )■ K for some K G 
[— oo, oo). Let X he a. random vector having a multivariate normal distribu- 
tion with density h/ j hdX with respect to A. By standard arguments, 
converges weakly to X. Therefore, for a rectangle A C R*^ we have 

P{xi G A > kn) = Pi^l eAn H,_^^^) ^F{XeAn Hk), 

since A fl Hk+e is a A-continuity set for all e G R. Taking A = R*^ gives 



P(S„ > kn) ^ Fix G Hk). 
Hence, for all rectangles A C 

FiXleA\En>kn)-.^^^^^. □ 



3.4 Law of large numbers 

Finally, we prove a law of large numbers, which we will need in Section 4. Let 
Xin denote a random variable with the conditional law of conditioned on 
the event {S„ > kn}. If (A;„ — E(S„))/A/n — )■ K for some K G [—oo, oo], then 
an immediate consequence of Theorems 1.5 and 1.6 is that Xin/n converges 
in probability to either piUi or Qaj. The following theorem shows that such 
a law of large numbers holds for a general sequence kn such that kn/n — )■ a. 

Theorem 3.7. For i G {1, . . . , M}, the random variable Xinjn converges in 
probability to piai if a < Yl^LiPiO^i, or to CiUi if a > Y^=\Vi(^i- 

Proof. If a 7^ '^ftiPiCa^ then (/c„ — E(S„))/A/ri goes to — oo or oo as n — oo, 
and the result immediately follows from Theorem 1.5 and Theorem 1.6. 



30 



Now suppose that a = "^i^iPiCa- Then q = pi for alH G {1, . . . , M}. 
Recall that in general the Cj and A are determined by the equations 



M 

Pi 1 PiO^i 

Ci = r and > r = a. 

Pi + A{1 - p,) ~tPi + ^(1 - Pi) 

The constant A is continuous as a function of a, hence Cj = Ci[a] is also 
continuous as a function of a. Therefore, if a = YliLiPi^^iy then for each 
e > we can choose 6 > such that Cj[a + 5]aj < Piai + \e. By Corollary 2.3 
we have, for large enough ra, 

^{Xin > {PiOli + e)n I S„ > kn) 

< P(Xm > {Pia^ + e)n | S„ > (a + 6)n) 

< P(Xi„ > {ci[a + S\ai + \e)n | S„ > (a + 5)n), 

which tends to as n — )■ oo by Theorem 1.5. Similarly, using Corollary 2.3 
and Theorem 1.6 instead of Theorem 1.5, we obtain 

^{Xin < {piai - 6)n I S„ > kn) -> 0. 

We conclude that Xin/n converges in probability to piUi = CiUi. □ 



4 Asymptotic stochastic domination 
4.1 Proof of Theorem 1.8 

Consider the general framework for vectors Xn and Yn of Section 1.2 in 
the setting of Section 1.4. We will split the proof of Theorem 1.8 into four 
lemmas. In the statements of these lemmas, we will need the constant a, 
which is defined as the limit as n — ?■ oo of kn/n: 

M M 

PifTlin , Pi'^i 



1 \ ^ Pilll^in , , \ ^ 

kn = / n 7 T , hence a = > 

Pi + Pmax(l -Pi) Pi + Pmax(l - Pi) 

Let us first look at the definition of a in more detail. In Section 1.4, we 
informally introduced the sequence kn as a critical sequence such that if kn is 
around kn, then there exists a block i such that the number of successes Xj„ 
of the vector X„ in block i is roughly the same as Yin- We will now make 
this precise. Recall that the Cj and the constant A are determined by 

M 

Pi 1 \- PiO^i 

a. 



and 



Pi + A{l-p,) j^pi + A{l-pi) 
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Furthermore, note that 

Pi ^ 
Pi + /3i{l-pi) 

and recall that we defined / = {«€{!,..., M} : /3j = /3max}- The ordering of 
a and a gives information about the ordering of the Cj and g^. This is stated 
in the following remark, which follows from the equations above. 

Remark 4.1. We have the following: 

(i) If a < a, then A > /3max and q < g, for alH G {1, . . . , M}. 

(ii) If a = d, then A = f3inax and q = qi for i G /, while q < qi for i ^ J. 

(iii) If a > d, then A < f3^a_x and q > for some z G {1, . . . , M}. 

(iv) Ej=i Pi^^i < a < Zlj=i with a = Y^fi^ PiUi if and only if /^^ax = 1, 
and a = J2i'Li li^^i if and only if all (3i G {1, . . . , M}) are equal. 

Our law of large numbers. Theorem 3.7, states that Xin/n converges in 
probability to piOi if a < J2iLi PiOa, and to CiUi if a > PiCti- This law of 
large numbers applies analogously to the vector 1^. If we define di, . . . , cIm 
as the unique solution of the system 

= V„e{l,...,M}, 

di 1 - qi dj 1 - qj 

2^i=i diUi = a, 



then Yin/n converges in probability to gjCtj if a < J^fLili^^iy and to d, 



if a > YliLili^^i- These laws of large numbers and the observations in 
Remark 4.1 will play a crucial role in the proofs in this section. 

Now we define one-dimensional (possibly degenerate) distribution func- 
tions Fk : R — )• [0, 1] for K G [—oo, oo], which will come up in the proofs as 
the distribution functions of the limit of a certain function of the vectors Xn- 
Recall from Section 1.3 the definitions (5), (6), (7) and (8) of the measure uq, 
the functions / and h and the half-space Hk- Write u = {ui, . . . ,um)- Then 

if A < oo, a = 



FKiz) 



frv- ^ 1 fiu)diyn(u) 2H) 
jr— It K < OO, a > l^^^^piai, 

J J "i^O 

if K = cx), 
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where 

= ^^/''^'-'f' K. (29) 

The following lemmas, together with Proposition 1.3, imply Theorem 1.8. 
Lemma 4.2. If a < a, then supP(X„ < Yn) 1. 

Lemma 4.3. Suppose that a > a and Pi ^ /3j for some i,j G {1, . . . , M}. 
Then supP(X„ < Yn) ^ 0. 

Lemma 4.4. Suppose that a = a and Pi ^ Pj for some i,j G {1, . . . , M}. 
Suppose furthermore that [kn — kn)/ \/n — )■ K for some K G [—oo, oo]. Then 
supP(X„ < Yn) ^ in{,^^FK{z) - $(z/a) + 1. 

Lemma 4.5. If a = a and Pi ^ Pj for some i,j G {1, . . . , M}, then 

1 if K = — oo, 

inf Fk{z) - $(z/a) + 1 = Ip^ ifKeR, where < Px < 1, 
zeR I 

ifK = oo. 

The constant a in Lemma 4.4 is the constant defined in (9a). The infimum 
in Lemma 4.4 can actually be computed, as Lemma 4.5 states, and attains 
the values stated in Theorem 1.8, with as defined in (10). 

We will prove Theorem 1.8 by proving each of the Lemmas 4.2-4.5 in 
turn. The idea behind the proof of Lemma 4.2 is as follows. If we do not 
condition at all, then X„ < Yn for every n > 1. If a < X]f=iPiO^«5 then 
the effect of conditioning vanishes in the limit and supP(X„ < Yn) — ?■ 1 
as n — )■ cxD. If J2iLiPi(^i < a < a, then q < for all i G {1,...,M}. 
Hence, for large n we have that Xin is significantly smaller than Yin for all 
2 G {1, . . . , M}, from which it will again follow that supP(X„ < Yn) — )■ 1. 

Proof of Lemma 4-3- First, suppose that a < "^itiPiC^i- Let X„ and Yn be 
defined on a common probability space {Q, J-", P) such that X„ < 1^ on all 
of Q. Pick ui E Q according to the measure P( ■ | ^*li^m > kn) and pick 
W2 G independently according to the measure P{-\EZiy^n>kn). If UJ2 
is in the event {^j=i-^m > ^n} £ J^, set Yn{ijji,ijj2) := l^(co'i), otherwise 
set Yn{uji,uj2) ■= Yn{u2). Set Xn{uJi,U2) := Xn{uJi) regardless of the value 
of U2. It is easy to see that this defines a coupling of X„ and Yn on the space 
{Q X Q, X J^) with the correct marginals for Xn and Yn. Moreover, in this 
coupling we have Xn < 1^ at least if ijJ2 G {J2iLi^in > ^n}- Hence 

SUpP(X„ < Yn) > Z^^'k^'"-^^^ 



P(y^j— 1 Yin ^ kn) 
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which tends to 1 as n — )■ oo (e.g. by Chebyshev's inequahty). 

Secondly, suppose that J^fLiPi^^i < a < d. By Remark 4.1(i), Cj < for 
alH G {1, . . . , M}. For each couphng of X„ and Yn we have 

P(X„ < Yn) > P(X,„ < (q + gOa^'^/S < Vz G {1, . . . , M}), 

which tends to 1 as n — )■ oo by Theorem 3.7 and Remark 4.1(iv). □ 

The next lemma, Lemma 4.3, treats the case a > a. In this case, we have 
that for large n, Xj„ is significantly larger than Yin for some i G {1, . . . , M}, 
from which it follows that supP(X„ < Y^) — )■ 0. 

Proof of Lemma 4-3- First, suppose that a < a < Yl!i!=i1i^i- Then Cj > 
for some i G {1,...,M} by Remark 4.1(iii). Hence, by Theorem 3.7 and 
Remark 4.1(iv), 

P(X,„ > (ci + qi)ainl2) 1, 

It follows that P(X„ < Yn) tends to uniformly over all couplings. 

Next, suppose that a > Yl^i li'^i l^i f^j fo'^ some i, j G {1, . . . , M}. 
Then there exists i G {1, . . . , M} such that Ci di, since 

l^-jk_dj_ _ l-qi Pj _ n ^-Cj Cj 
dj 1 - dj qi 1 - Q 1 - Cj 

In fact, we must have q > (ij for some i G {1, . . . , M}, because CjOi = 
X]i=i '^jO^j- By Theorem 3.7, it follows that 

P(X„ > (q + di)ain/2) ^ 1, 
P(n« > (q + c?i)airi/2) ^ 0. 

Again, P(X„ < 1^) tends to uniformly over all couplings. □ 

We now turn to the proof of Lemma 4.4. Under the assumptions of this 
lemma, Cj = q^ for i E I and Cj < qi for i ^ I. The proof proceeds in four 
steps. In step 1, we show that the blocks i ^ I do not influence the asymptotic 
behaviour of supP(X„ < Yn), because for these blocks, Xin is significantly 
smaller than Yin for large n. In step 2, we show that the parts of the vectors 
Xn and Yn that correspond to the blocks i E I are stochastically ordered, 
if and only if the total numbers of successes in these parts of the vectors 
are stochastically ordered. At this stage, the original problem of stochastic 
ordering of random vectors has been reduced to a problem of stochastic 
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ordering of random variables. In step 3, we use our central limit theorems 
to deduce the asymptotic behaviour of the total numbers of successes in the 
blocks i E I. In step 4, we apply the following lemma, which follows from [6, 
Proposition 1], to these total numbers of successes: 

Lemma 4.6. Let X and Y be random variables with distribution functions 
F and G respectively. Then we have 

supP(X <Y) = inf F(z) - G(z) + 1, 

where the supremum is taken over all possible couplings of X and Y. 

Proof of Lemma 4-4- Write m/„ := Ylii^i'^'^in- Let X/„ and Xin denote the 
m/n-dimensional subvectors of X„ and respectively, consisting of the 
components that belong to the blocks i E I. Define Yin and Yin analogously. 
Step 1. Note that for each coupling of X„ and Yn, 

V{Xn < Yn) > ¥{Xin < Y/n, Xin < (Q + g.)«.n/2 < Y^n ^ I) 
> F{Xin < Yin) - 

5^{p(x,„>^«,n)+p(K„<^a.n)}. (30) 

By Remark 4.1(ii), Cj < for i ^ I. Hence, it follows from Remark 4.1(iv) 
and Theorem 3.7 that the sum in (30) tends to as n — )■ oo, uniformly over 
all couplings. Since clearly supP(X„ < 1^) < supP(X/„ < !/„), 



SUpP(X„ < Yn) - SUpP(X/„ < Yin] 







where the suprema are taken over all possible couplings of (X„, Yn) and 
{Xin, Yin), respectively. 

Step 2. The /3j for i E I are all equal. Hence, by Proposition 1.2 and 
Lemma 2.2 we have for m G {0, 1, . . . , m/„} and i G {0, 1, . . . , rriin — m} 

CiXIn\J:^eI^in = m) ^C{Yin\j:iel'^in = m + i). (31) 

Now let B be any collection of vectors of length min with exactly m compo- 
nents equal to 1 and min — m components equal to 0. Then 

P{Xin eB)= F{Xin G B I E£l^.n > kn) 



FiXineB)F{j:^^,X,n>kr: 



m] 



PiEfilXin>kn) 
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Taking C to be the collection of all vectors in {0, l}"*^" with exactly m com- 
ponents equal to 1, we obtain 

P(X,„ G B I E^eIX^n = m) = ^1^^" ^ ^\ = P(X,„ e B I = 

and likewise for l/„ and Yjn- Hence, (31) is equivalent to 

C{Xjn\ Y^idi^in = m) ^ C{Yin\ J2iei Yin = m + i) . 
With a similar argument as in the proof of Proposition 1.3, it follows that 

SUpP(X/„ < Yin) = SUpP(E.6/^m < EieI^^n)■ 

Step 3. First observe that by Remark 4.1(iv), a < J^fii^i'^i- Hence, by 
Theorem 1.6 (note that (fc„ — ^{J2i=i '^in))/\/n —oo) and the continuous 
mapping theorem, 

P(Eie/(^m - qim^n)/Vn < z) $(2/0) for every zeR. (32) 

Next observe that by Remark 4.1(ii), q = qi for z G / and A = /3max, from 
which it follows that A;„ = Xli^i '^i'^in- Hence, Corollary 3.5 gives 

Y^idli^in - qi)min/ y/n Zr, (33) 

with zk as defined in (29). In the case a > J^iLiPi^^iy Theorem 1.5, (33) 
and the continuous mapping theorem now immediately imply 

P(Eie/(^m - qimin)/Vn < z) ^ Fk{z) for every z eR. (34) 

Note that if = ±oo, is degenerate in this case: we have Fx{z) = 1 for 
all z G P if -ft' = -oo and Fk{z) = for all z G P if = oo. 

Now consider the case a = J^fLiPiO^i- By Remark 4.1(iv), in this case 
we have /3max = 1, which implies that A;„ = YldLiPi^in = E;(S„) and pi = qi 
for all i G {1,...,M}. Hence, if K = oo, then (33) and Theorem 1.5 
again imply (34) with Fx{z) = everywhere. If iiT G [—00,00), then we 
obtain (34) directly from Theorem 1.6; Fk is non-degenerate in this case 
(also for K = —00). 

Step 4. The distribution functions on the left-hand sides of (32) and (34) 
are non-decreasing and bounded between and 1, hence they converge uni- 
formly on compact sets. It follows by Lemma 4.6 that 

supP(Eie/^- < Eiei^in) ^ mi,^^FK{z) - <^{z/a) + 1. □ 
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Finally, we turn to the proof of Lemma 4.5. The key to computing the 
infimum of FK{z) — ^{z/a) + l is to first express the distribution function F^, 
defined in (28), in a simpler form. 

Proof of Lemma 4-5. In the case a > X]f=iP«'^« ^ ~ ~oo. Fx is 1 
everywhere, hence inf^gjR Fft:(2) — $(z/a) + 1 = 1. In the case = oo, 
is everywhere, hence inf^gjR ^^-(2;) — $(z/a) + 1 = 0. We will now study 
the remaining cases. 

Consider the case a = a = J^fLiPiOa and K G [—00,00). Let Z = 
{Zi, . . . , Zm) be a random vector which has the multivariate normal distri- 
bution with density h/ J hdX. By Remark 4.1(iv) we have /3max = 1- Note 
that therefore, ^ Xlie/ I ■^i a^*i \ l^i^i -^i' with a, h and c as defined 
in (9), all have the standard normal distribution. Moreover, Xlie/ ^"^^ 
Yli^ independent. 

For K = —00, it follows that Fk{z) = ^{z/a), hence inf^giR Fft-(z) — 
$(z/a) + 1 = 1. For K E H, observe that Z G Hx is equivalent with 
c Si^i — ^1 ^- Likewise, Z e Hk H {u G R*''^ : Xlie/ — ^} equivalent 
with I Y^-^j Zi< z/a and I Y^.^j Z^ > {K - Y^.^j Zi)/h. It follows that 



u) 



Fk(z) 



hdX J hdX 



hence 



^/r^ / \ I I / — / — dvdu 
1 - $(ir/c) Jk^ y27r y27r 

/•z/a -«2/2 $ ('Z^'l _ $ (K-au\ 

Fk{z) - $(./a) = y ^ ^ l-$(i^)' ^^^^ 

Clearly, the derivative of this expression with respect to 2; is if and only if 
{K — z)/h = Kjc, that is, z = z^i^^ = K — bK/c. Plugging this value for z 
into (35) shows that inf^giR Fx(-2) — ^{z/a) + 1 = Pk, with Pk as defined 
in (10). Moreover, Pk > because Fxizmm) > 0, and Pk < 1 because the 
integrand in (35) is negative for u < z^-y^ja. 

Finally, consider the case a > YfLiPi^^i K E H. This time, let 
Z = {Zi, . . . , Zm) be a random vector which has the singular multivariate 
normal distribution with density /// f duo with respect to uq. Then a little 
computation shows that {Zi, . . . , Zm-i) has a multivariate normal distribu- 
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tion with mean and a covariance matrix S given by 



M 2 



= : for ^ G {1, . . . , M - 1}, 



EM 
k=l 



a 



k 



S., = \ for t,je{l,...,M-l} with t ^ J, 



ElVl 
k=l^ 



k 



where af = Cj(l — Cj)aj for i G {1, . . . , M}. Similarly, every subvector of Z 
of dimension less than M has a multivariate normal distribution. 

By the definition (28) of Fx, zk + ^jg/ Zi has distribution function Fk- 
Since /3j 7^ /3j for some z, j G {1, . . . , M}, we have |/| < M — 1. It follows 
that Xlie/ ^« ^ normal distribution with mean and variance 

ie/ 2^k=i^k iei ^k=i^k z^i=i^i 

By Remark 4.1(ii), A = /3max and hence for i G {1, . . . , M}, 



It follows that the variance (36) is equal to a^6^/c^, with a, b, and c as defined 
in (9). Furthermore, zk = a?K/(?. We conclude that Fx is the distribution 
function of a normally distributed random variable with mean a^Kj(? and 
variance d^\? so that Fk{z) = ^ [-^{z — a'^ K / c^)) . Since a'^b^/c^ < a^, we 
see that Fk{z) < $(z/a) for small enough z. Hence Fk{z) — ^{z/a) attains 
a minimum value which is strictly smaller than 0. This minimum is strictly 
larger than —1 because Fk{z) > for all z G R. 

To find the minimum, we compute the derivative of Fk{z) — ^{z/a) with 
respect to z. It is not difficult to verify that the minimum is attained for 

z = z^^ = K- ^-y^K^ + cHogicyb^), 

from which it follows that inf^gjR ^^-(z) — $(z/a) + 1 = Pk, with Pk as 
defined in (10). From the remarks above we know that < Pk < 1- □ 



4.2 Conditioning on exactly kn successes 

For the sake of completeness, we finally treat the case of conditioning on 
the total number of successes being equal to kn- The situation is not very 
interesting here. 
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Theorem 4.7. Let X„ be a random vector having the conditional distribution 
of Xn, conditioned on the event {S„ = A;„}. Define Yn similarly. If all f3i 
(i G {1, . . . , M} ) are equal, then X„ and Yn have the same distribution for 
every n > 1. Otherwise, supP(X„ = Yn) — )■ as n oo. 

Proof. If all Pi (i G {1, . . . , M}) are equal, then by Proposition 1.2 we have 
that Xn and Yn have the same distribution for every n > 1. If /3j 7^ /3j for 
some i,j G {1,...,M}, then it can be shown that supP(X„ < Yn) — )■ 
as n — 00, by a similar argument as in the proof of Lemma 4.3; instead of 
Theorem 3.7 use Lemma 3.L □ 
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